Picture for Shu Yao

Shu Yao

PRISM: A Benchmark for Programmatic Spatial-Temporal Reasoning

Add code
May 19, 2026
Viaarxiv icon

Towards a Science of Collective AI: LLM-based Multi-Agent Systems Need a Transition from Blind Trial-and-Error to Rigorous Science

Add code
Feb 05, 2026
Viaarxiv icon